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We analyze the effect of a heterogeneous variance on bump detection in a Gaussian 
regression model. To this end we allow for a simultaneous bump in the variance and 
specify its impact on the difficulty to detect the null signal against a single bump with 
known signal strength. This is done by calculating lower and upper bounds, both based 
on the likelihood ratio. 

Lower and upper bounds together lead to explicit characterizations of the detection 
boundary in several subregimes depending on the asymptotic behavior of the bump 
heights in mean and variance. In particular, we explicitly identify those regimes, where 
the additional information about a simultaneous bump in variance eases the detection 
problem for the signal. This effect is made explicit in the constant and / or the rate, 
appearing in the detection boundary. 

We also discuss the case of an unknown bump height and provide an adaptive test 
and some upper bounds in that case. 
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1. Introduction 


Assume that we observe random variables Y = (Yi,..., F„) 


through the regression model 


Yi — fXn 




i = 1, ...,n, 


( 1 ) 


where Zi Af (0,1) are observational errors, denotes the mean function and is the variance 
function sampled at n equidistant points i/n, say. The aim of this paper is to analyze the effect 
of a simultaneous change in the variance on the difficulty to detect = 0 against a mean of the 
form 


tJ-n {x) = {x) 


A„ if a: G 
0 otherwise, 


( 2 ) 


^Corresponding author 
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i.e. a bump with height A„ > 0 on an interval C [0,1] as location. The assumption that 
An > 0 is posed only for simplicity here, if A„ is negative or is considered, similar results can 
be obtained analogously. Further, in model Q we assume a simultaneous change of the variance 
by 

= cTo +(x), xG[0,l] (3) 

with a “baseline” variance Uq > 0. Note that the additional change in the variance cr^ > 0 may 
only occur when the mean has a bump of size A„ on We assume throughout this paper that 
an and ctq are known, and that also the length |/„| of In is known. Note that as the position of 
In is unknown, both functions /x„ and A„ are (partially) unknown as well. Concerning A„ we will 
later distinguish between the cases that A„ is known (to which we will provide sharp results) and 
unknown. 

The model arising from Q-([^ with the above described parameters will be called the het¬ 
erogeneous bump regression (HBR) model, which is illustrated in Figure The aim of this 
paper is to provide some first insights for precise understanding of the effect of the heterogeneity 
on detection of the signal bump. 



0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 


Figure 1: The HBR model: Data together with parameters A„,(To and in 0-© from the 
HBR. Here A„ = 4, tTg = 1, cr^ = 4 and n = 512. 


1.1. Applications 

The HBR model may serve as a prototypical simplification of more complex situations, where 
many bumps occur and have to be identified. In fact, in many practical situations it is to be 
expected that the variance changes when a change of the mean is present; see e.g. |38j for a 
discussion of this in the context of CGH array analysis. Another example arises in econometrics 
and financial analysis, where it has been argued that price and volatility do jump together [ 3131 ]. 
The HBR model can also be viewed as a heterogeneous extension of the “needle in a haystack” 
problem for |/„| 0, i.e. to identify s small cluster of non-zero components in a high-dimensional 

multivariate vector m [3 [HI Hi [SO]- 

Finally we mention the detection of opening and closing states in ion channel recordings (see 
m and the references therein), when the so-called open channel noise is present which arises for 
large channels, e.g. porins [431148j . A small segment of a current recording of a porin in planar 
lipid bilayers performed in the Steinem lab (Institute of Organic and Biomolecular Chemistry 
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University of Gottingen) is depicted in Figure We find that the observations fluctuate around 
two main levels, which correspond to the open and closed state. It is also apparent that the 
variance changes together with the mean. 



Figure 2: Current recordings of a porin in picoampere (pA) sampled every 0.4 milliseconds. 


1.2. Detailed setting and scope of this paper 

Let us now formalize the considered testing problem. Let C [0,1], n S N a given sequence of 
intervals. W.l.o.g. assume in the following that /„:=!/ |/„| only takes values in the integers, i.e. 

S N for all n G N and define 

An-.= {[U-i)\In\,j\In\] \ I < J < Q ■ (4) 

Then for fixed n G N we want to test 

Ho ■■ fJ.n = 0, A„ = (To, CTo > 0 fixed 

against (5) 

Hi 3 In G An S.t. ^n — = (Jq “t" (Ty^l/^, (7n 0. 

Throughout the most of this paper we assume that the parameters A„,(To,(t„ and |/„| are 
known, only the position /„ of the bump is unknown. Note, that if the location of the bump 
was also known (and not only its width |/n|), then the whole problem reduces to a hypothesis 
test with simple alternative, i.e. the Neyman-Pearson test will be optimal. This situation may 
be viewed as a variation of the celebrated Behrens-Fisher problem (see e.g. [37]) to the situation 
when the variance changes with a mean change. 

The requirement that A„ is known will be relaxed later on. 

The main contribution of this paper is to precisely determine the improvement coming from the 
additional change in variance by determining the detection boundary for (§. We stress that this 
issue has not been addressed before to the authors best knowledge, and it has been completely 
unclear if and to what extend the additional change in variance adds information to the model so 
far. We will provide some partial answers to this. 

The testing problem (§ is in fact a simplihcation of the practically more relevant problem where 
the interval /„ in the alternative is not necessarily an element of An-, but can be arbitrary in [0,1] 
with the predefined length |/„|. Furthermore the parameters |/„|, (Tq and (t„ will also be unknown 
in most practical situations, although often for the in Section 1.1 mentioned channel recordings 
the variances and the conductance level A„ can be pre-estimated with high accuracy, and hence 
regarded as practically known. In fact, lack of knowledge of all these quantities leads to a much 
more complex problem and raises already a remarkable challenge for designing an efficient test, 
which is far above the scope of this paper. Instead, we aim for analyzing the amount of information 
contained in the most simple HBR model compared to the homogeneous case, and therefore we 
restrict ourselves to the simplified (but still quite complicated) situation ([^. Note that our lower 
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bounds for the detection boundary are also lower bounds for the general situation (although we 
speculate that then not sharp). 

Our results are achieved on the one hand by modifying a technique from Diimbgen & Spokoiny 
[22] providing lower bounds in the homogeneous model. We will generalize this to the case of a 
non-central chi-squared likelihood ratio which appears due to the change in variance. On the other 
hand we will analyze the likelihood ratio test for (§ which then provides upper bounds. Doing so 
we use a generalized deviation inequality for the weighted sum of non-central chi-squared random 
variables from [35] (see Appendix]^ for a discussion of this and related inequalities). The authors 
there also study a heterogeneous regression problem, but in a setting which is not related to bump 
detection. 

1.3. Analytical tools and literature review 

Minimax testing theory. In this paper, the analysis of the HER model is addressed in the context 
of minimax testing theory, and we refer e.g. to a series of papers by Ingster [^ and to Tsybakov 
m for the general methodology. Following this paradigm we aim for determining the detection 
boundary which marks the borderline between asymptotically detectable and undetectable signals 
of type (^. 

In the following we consider tests : M" —>■ {0,1}, where (E) = 0 means that we accept the 
hypothesis Hq and {Y) = 1 means that we reject the hypothesis. We denote the probability 
of the type I error by 


d ($„, (To) := Eho [$„ (>")] = Pffo (>") = 1) 

and say that the sequence of tests has asymptotic level a G (0,1) under Hq, if lim sup a ($„, erg) < 

n—¥oo 

a. The type II error is denoted by 


\^n \ ; ^0 ; ^n) ■ Slip (^) 

In^An 

and lim sup p |/„|, (Jq, <7n) is the asymptotic type II error of the sequence of tests under 

n—yoo 

hypothesis Hq and alternative determined by the parameters A„, |/„| ,ao and cr„. Below we 
will skip the dependence on the parameters to ease the presentation. 

Obviously, a fixed bump can be detected always with the sum of type I and II errors tending 
to 0 as n — > (X). Thus we are interested in asymptotically vanishing bumps, hence we want 
to investigate which asymptotic behavior of A„, (t„ and |/„| \ 0 implies the following twofold 
condition for a fixed o; > 0 (cf. [34]h 

a) lower detection bound: let consist of faster vanishing bumps. Then for any test with 
limsupd ($„) < a it holds lim inf jd (d)„) > 1 — a. 

n—^QO n—>-oo 

b) upper detection bound: let consist of slower vanishing bumps. Then there is a sequence 
of tests with limsupd($„) < a and limsup/3($„) < a. 

n—yoo n—>-oo 

Typically the terms faster and slower are measured by changing the constant appearing in the 
detection boundary by an ±e„ term. If upper and lower bounds coincide (asymptotically), we 
speak of the (asymptotic) detection boundary, cf. ®> below. 

In fact we will see that for our specihe situation the obtained asymptotic behavior A„, (7„ and 
\In\ \ 0 does not depend on the specific choice oi a € (0,1). Consequently, we determine condi¬ 
tions for Hq and iJ" to be either consistently distinguishable or asymptotically indistinguishable 
in the spirit of Ingster (cf. [23]'). 
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Literature review. Motivated by examples from biology, medicine, economics and other applied 
sciences, methods for detecting changes or bumps in a sequence of homogeneous observations 
have been studied extensively in statistics and related fields. We refer to Csorgo & Horvath [19], 
Siegmund [IS], Carlstein et al. [TB] or Frick et al. |53| for a survey. Estimating a piecewise 
constant function in this setting consists on the one hand in a model selection problem (namely 
estimating the number of change points) and on the other hand in a classical estimation problem 
(estimating the locations and amplitudes). For the former, we refer to Yao [FTI, who proposed 
a method to estimate the total number of change points in the data by Schwarz’ criterion, and 
to Bai & Perron |4], who investigate a Wald-type test. For a more general approach to treat the 
change-point problem from a model selection perspective we refer to Birge & Massart [Sj , and for 
a combined approach with confidence statements and estimation, see Frick, Munk & Sieling |24] . 
Concerning the latter, Siegmund & Venkatraman |46j used a generalized likelihood ratio to estimate 
the location of a single change point in the data. The overall estimation problem has e.g. been 
tackled by Boysen et al. [12] via £o-penalized least-squares and by Harchaoui & Levy-Leduc [26] 
via a TV-penalized least-squares approach. Siegmund, Yakir & Zhang m proposed a specialized 
estimator for the situation of many aligned sequences when a change occurs in a given fraction of 
those. Recent developments include efficient algorithms for the overall estimation problem based 
on accelerated dynamic programming and optimal partitioning by Killick, Fearnhead & Eckley 
[33] and [24], on penalized or averaged log likelihood ratio statistics by Rivera & Walther [41], or 
Bayesian techniques as proposed by Du, Kao & Kou m- We also mention the recent paper by 
Goldenshluger, Juditski & Nemirovski [^, which has been discussed in this context [5U] . 

The special question of minimax testing in such a setting has also been addressed by several 
authors. We mention the seminal paper by Diimbgen & Spokoiny [^ and also Diimbgen & 
Walther [23] who studied multiscale testing in a more general setup, and [24] for jump detection 
in change point regression. Furthermore, we refer to Chen & Walther [18] for an optimality 
consideration of the tests introduced in mi, and to a rigorous minimax approach for detecting 
sparse jump locations by Jeng, Cai & Li [32] ■ Nevertheless, all these papers only address the case 
of a homogeneous variance, i.e. A„ = cto- There, it is well-known that the detection boundary is 
determined by the equations 

\/n |/„|A„ = v^cro\/-log(|/„|), (6) 

and the terms “faster” and “slower” in the above definition are expressed by replacing V^cro by 
V^ao ± Sn with a sequence (e„)„gpj such that e„ —)■ 0, log (|/„|) —t oo as n —>• oo (for details 

see [33]1. 

In the heterogeneous model and the connected testing problem ([^ mainly two situations 

have to be distinguished, see also [36] . 

The first situation emerges when it is not known whether the variance changes (but it might). 
Then as in ^ we explicitly admit tr^ = 0 and the variance is a nuisance parameter rather than an 
informative part of the model for identification of the bump in the mean. Hence, no improvement 
of the bound (|^ for the case of homogeneous variance is to be expected. The estimation of the 
corresponding parameters has also been considered in the literature. Huang & Chang [28] consider 
a least-squares type estimator, and Braun, Braun & Muller m combine quasilikelihood methods 
with Schwarz’ criterion for the model selection step. We also mention Arlot & Celisse [3] for an 
approach based on cross-valiation and Boutahar m for a CUSUM-type test in a slightly different 
setup. Even though, a rigorous minimax testing theory remains elusive and will be postponed to 
further investigations. 

The second situation, which we treat in this paper, is different: We will always assume > 0 (as 
in the examples in Section 1.1), which potentially provides extra information on the location of the 
bump and detection of a change might be improved. This is not the case in the situation discussed 
above, as the possibility of a change in variance can only complicate the testing procedure, whereas 
cr^ > 0 gives rise to a second source of information (the variance) for signal detection. The central 
question is: Does the detection boundary improve due to the additional information and if so, 
how much? 
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Relation to inhomogeneous mixture models. Obviously, the HBR model is related to a Gaussian 
mixture model 

r, ~(l-£)AA(0,a2)+eAA(A,a2), * = l,...,n, (7) 

which has been introduced to analyze heterogeneity in a linear model with a different focus, see 
e.g. min HU. For e ^ n ^(0</3<l) different (asymptotic) regimes occur. If /3 > 1/2, the 
non-null effects in the model are sparse, which leads to a different behavior as if they are dense 
(/3 < 1/2). Although it is not possible to relate Q with the HBR model in general, it is insightful 
to highlight some distinctions and commonalities in certain scenarios (for the homogeneous case 
see the discussion part of [H])- ^ main difference to our model is that the non-null effects do not 
have to be clustered on an interval It is exactly this clustering, which provides the additional 
amount of information due to the variance heterogeneity in the HBR model. A further difference 
is that the definition of an i.i.d. mixture as in Q intrinsically relates variance and expectation of 
the Yi. 


1.4. Organization of the paper 

The paper is organized as follows: In the following Section we present our findings of the 
detection boundary. The methodology and the corresponding general results are stated in Section 
[U Afterwards we derive lower bounds in Section and upper bounds in Section There we also 
discuss upper bounds for the case that A„ is unknown and provide a likelihood ratio test which 
adapts to this situation. In Section]^ we present some simulations and compare the finite sample 
power of the non-adaptive test and the adaptive test. Finally we will discuss some open questions 
in Section To ease the presentation all proofs will be postponed to the Appendix. 

2. Overview of results 

Throughout the following we need notations for asymptotic inequalities. For two sequences (a„)„gj,j 
and (6n)„gjj, we say that a„ is asymptotically less or equal to and write a„ ^ if there exists 
N € N such that < bn for all n > N. Similarly we define an ^ bn- If an/bn —> c as n —> oo for 
some c S M \ {0} we write an ^ bn- If c = 1 we write an^^bn- We use the terminology to say that 
an and bn have the same asymptotic rate (but probably different constant) if an bn, and that 
they have the same rate and constant if an ^ bn- Consequently, the relation in § determining 
the detection boundary in the homogeneous case becomes 

\/n |/„|A„ X y2cro\/-log(|/„|). 

The notations ^ and x coincide with standard notations. The definitions of and Y can be 
seen as extensions of the classical minimax notations by Ingster [21] or Diimbgen & Spokoiny [22] , 
where the null hypothesis is tested against the complement of a ball around 0 with varying radius 
p = p(n). The results are typically presented by stating that if p{n) tends to 0 faster than the 
boundary function p* (n), or if their ratio converges to some constant < 1, then the null and the 
alternative hypotheses are indistinguishable. 

In our setting, the asymptotic conditions are defined in terms of sums rather than ratios. Thus 
we introduced and Y to have similar notations. 

In the HBR model, our signal has a change in mean determined by A„ and also a change in its 
variance, which will be described by the parameter 

:= — > 0 . 

CTO 

In fact, we will see that the detection boundary is effectively determined by the ratio of and 
A„, which leads to three different regimes: 
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DMR Dominant mean regime. If vanishes faster than A„, the mean will asymptotically 
dominate the testing problem. In this case, the additional information will asymptotically 
vanish too fast, so that we expect the same undetectable signals as for the homogeneous 
case. The dominant mean regime consists of all cases in which k^/A„ —>■ 0, n —>■ oo. 

ER Equilibrium regime. If has a similar asymptotic behavior as A„, we expect a gain from 
the additional information. The equilibrium regime consists of all cases in which —)■ 0, 
A„ 0 and c := lim„_).oo Kn/(^n/o'o) = lim„^oo o-^/(A„cro) satisfies 0 < c < oo. 

DVR Dominant variance regime. If A„ vanishes faster than the variance will asymptoti¬ 
cally dominate the testing problem. In this case, we expect the same detection boundary as 
for the case of testing for a jump in variance only. The dominant variance regime consists 
of all cases in which 0 and k^/A„ —>■ oo, n —>■ oo. 


With this notation, we can collect the most important results of this paper in the following 
Table [2 The corresponding detection boundaries in different regimes are also illustrated in Figure 




Table 1: Main results of the paper. 



rate 

constant 

lower bound 

upper bound 

An known 

An unknown 

DMR 

V^n|4|A„ ~ log (141) 

\/2ao - 
Thm. 

- e, 

4.1 

1 

V^ao - 
Thm. 

■ £n 

5.1 


- 

Thm. 

^ £n 

5.4 


ER 

^n|4|A„ ~ A/-log(|4|) 

Thm. 

7^ 

4.2 


Thm. 

7^ ^ 
5.2 

■ Sn 

c+\/2+3 

^0 l+c2 

Thm. 

- 4- 

5.4 

£n 

\/n|/„|K^ ~ \/-log(|4|) 

cf. (2 

— 

4) 

-n 

2\/ 2^ 
analog to (2 

n 

4) 

l+c^ 

analog to (2 

4) 

DVR 

^n\In\Ki ~ ^-logim) 

2-£ 

Thm. 

-n 

4.3 


2 + £ 

Thm. 

n 

5.3 


1 + v^ 
Thm. 

+ s 

5.4 

a 


Here (e„) is any sequence such that —>■ 0 ,£„a/— log (|/„|) —>■ oo. 


The second column depicts the rates obtained in the different regimes, the columns three to five 
give the constants in different situations. ER is stated twice, because the results are shown w.r.t. 
the different rates from DMR and DVR respectively. 

Exemplary, the lower bound entry in the DMR denotes that signals are no longer detectable if 
^/n\I^\An ^ (v^cTo — £n) a/— log (|/n|)- Vice versa, the upper bound entry in the DMR means 
that signals are detectable as soon as ^/nJlnjAn ^ (v^co + Sn) -y/— log (|/n|), no matter if A„ is 
known or needs to be estimated from the data. 


A„ known. It can readily be seen from Tablethat the lower and the upper bounds with known 
A„ coincide up to the ±£„ term in all regimes. This directly implies that the detection boundaries 
are determined by these constants, which we will describe in more detail now. 
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DMR: A comparison of the lower and upper bounds in Table yields that the detection 
boundary is given by 

\/n |/„|A„ X V2cro\/-log(|/n|)- (8) 

Hence, the detection boundary in the dominant mean regime coincides with the detection boundary 
in the homogeneous model (cf. §)■ More precisely, if the additional information about a jump 
in the variance vanishes faster than the jump A„ in mean, we cannot profit from this information 
asymptotically. 

ER: It follows similarly that the detection boundary is given by 

\/n|/„|A„ X log(|/„|), (9) 

where c = lim„_>.oo cr^/A„. The characterization shows that we do always profit from 
the additional information in the ER case as 2 +c'^ < 1 whenever c > 0. Compared to the 
homogeneous model or the dominant mean regime the constant improves but the rate stays the 
same. Note, that the improvement can be quite substantial, e.g. if c = 1, which amounts to the 
same magnitude of (t„ and A„, the additional bump in the variance leads to a reduction of 33% 
sample size to achieve the same power compared to homogeneous bump detection. If c = 2, we 
obtain a reduction of 66%. 

As A„ and are of the same order in the ER, we may replace A„ by in ([^ and obtain the 
equivalent characterization 


\/n\In 


2 + c- 


V- l0g(|/n|)- 


( 10 ) 


This formulation allows for a comparison with the DVR below and is also depicted in Table 
second ER row. 

DVR: We find from entries three and four in the DVR row of Table [T] that the detection 
boundary is given by 

X 2a/- log(|/„|). (11) 

If the jump in variance asymptotically dominates the jump in mean, we exactly obtain the detection 
boundary for a jump in variance only. This is a natural analog to Q and coincides with the findings 
in the literature, cf. [TH Sect. 2.8.7.]. 

Finally note that if c in (10) tends to cx), we end up with In this spirit a constant c < 00 
can be also seen as an improvement over the pure DVR where the rate stays the same but the 
constant decreases, which is an analog to the comparison of the DMR and the ER, see above. A 
summary of the obtained detection boundary statements is illustrated in Figure]^ 


A„ unknown. In case that A„ is unknown, it has to be estimated from the data Y and the 
corresponding likelihood ratio test will be introduced in Section 4.4. The change in the considered 
statistics leads to different upper bounds. It is readily seen from Table that the obtained upper 
bounds do not coincide with the lower bounds in all regimes. 

DMR: It follows from the last entry in the DMR row that ([^ also characterizes the adaptive 
detection boundary for unknown A„. 

ER: If the parameter A„ is unknown, it can be seen from the last entry in the ER row that 
it is unclear if the lower bound remains sharp. We do not have an adaptive lower bound which 
would allow for an explicit statement here. 

DVR: In this regime it is again unclear if the lower bound is also sharp in case that A„ is 
unknown (again we do not have an adaptive one), but we consider it likely that the obtained 
upper bound 1 + -/S (cf. the last entry in the DVR row) is sharp. 
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DMR 


ER 


DVR 


-C, cf. @ 


V^CTo 


^Jn 

In 

- C^-ln( 

In\) 



- C^/-\n{\In\) 


C, cf. (10)-^ 


Figure 3: Illustration of the detection boundaries (§, (§, ( [To| ) and ( [Tl] ) in the different regimes. 

The dashed green lines illustrate the phase transitions between the DMR, ER and ER, 
DVR respectively, whereas the dashed blue line depicts the phase transition between the 
rates for DVR and DMR. Note, that on the left-hand side the x-axis corresponds to c, 
whereas on the right-hand side x- and y-axis are inverted and the x-axis belongs to 1/c. 


This loss of a constant can be interpreted as the price for adaptation and is quantified by the 
ratio r (c) of (331 and (25), (34) and (26) and (35) and (28) respectively. The ratios are given by 


r(c) 


1 


V2+c2(c-|-V2+3c2) 

2(l+c2) 

i+Vs 

2 


DMR, c = 0, 
ER, 0 < c < 00 , 
DVR, c = 00 , 


which are displayed in Figure]^ Note that r is a continuous function. Remarkably, the price for 
adaptation is never larger then as r attains its maximum at c = such that r (v^) = V^. 
As c —>■ oo, r tends to (1 -f •\/3)/2, as c —>■ 0, it tends to 1, meaning that adaptation has no cost at 
all in this situation. Note, that this is in line with findings for the homogeneous Gaussian model 

m- 


r (c) 



Figure 4: The price of adaptation r plotted against the parameter 0 < c < oo. 

If we relax the regimes ER and DVR by allowing for cr„ —> tr > 0, things become more com¬ 
plicated. We will prove lower bounds for those cases as well, which include logarithmic terms in 
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K = liin„^oo Kn (cf. Theorems 4.2 and 4.3 and values of C in (21b) and (22b) respectively). Our 
general methodology (cf. Theorem 3.3) will also allow for upper bounds in these situations (cf. 
^ and (!§), but it is clear that they cannot be optimal for the following simple reason: The 
underlying deviation inequality does not include logarithmic terms in k, neither do any from the 
literature (cf. Appendix [A|. 


3. General methodological results 

Fixing In in (§, we obtain the likelihood ratio by straightforward calculations as 
L„(A„,/„,K„;r) = (k^ + l) ^ expj ^ 


^ ^ + 2r,A„ - A2 




2 (1 + ^ 2 ) 0-2 


= Kr. 


!)■ 


exp 


2(1 + kI) erg 


S (« + !?) - 




2(7^ kI 


( 12 ) 


For the general testing problem ([^ where only the size |/„| is known but the location /„ not, 
the likelihood ratio function is then given by sup/^g^^ (A„, Y). 


3.1. Lower bounds 


Let be any test with asymptotic significance level a under the null-hypothesis ^ = 0, A„ = (Tq- 
We will now determine classes S := (S'„)„gf^ of bump functions, such that is not able to 
differentiate between the null hypothesis /i = 0, A„ = CTo and functions (/j,„, A„) G S'„ with type II 
error < I — a. To this end, we construct a sequence S such that 

d ($„) < a + o(l) +> 1 - ,0 ($„) - a < o (1), (13) 


which is equivalent. For such classes S we say that S is undetectable. 

In the following, the sequence S will always be characterized by asymptotic requirements on 
A„, Kn and |/„|. To keep the notation as simple as possible, only the asymptotic requirements will 
be stated below, meaning that the sequence S 

Sn = l+n = A^l/^, Xn = ctq + | A„, |/„| , Satisfy the specified requirements} 

is undetectable. 

To prove lower bounds we use the following estimate, which has been employed in |22l [24] as 
well: 

Lemma 3.1. Assume that 0-® hold true and An is given 6j/ (|^ . If is a test with asymptotic 
level a G (0,1) under the null-hypothesis /i = 0, i.e. a (4)„) < a + o (1), then 


1-/3 ($„) - a< E^=o,A=cro 



^ ^ Yn (A^i, In^ rin-, T) 1 


+ o(l), 


as n —>■ oo. 


(14) 


Recall that we want to construct S such that the right-hand side of (14) tends to 0. This can be 
achieved (as in [521 Lemma 6.2]) by the weak law of large numbers. By controlling the moments 
of Ln (A„,K„, y), this leads to the following theorem characterizing undetectable sets S'. 


Theorem 3.2. Assume the HER model is valid with known parameters A„,k„ and |/„|. The 
sequence S determining the asymptotic behavior of An, Kn and |/„| is undetectable, if \In\ \i 0 
and there exists a sequence Sn > 0, satisfying Sn < I/k^ such that for n —>■ oo, 


n\In\Al (l+(5„)(5„ 
2al I -SnK^ 


r ^ \^n\ , 2\ 


log (1 - SnKl) + Sn log (|/„|) -OO. 


(15) 
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3.2. Upper bounds 


To construct upper bounds we will now consider the likelihood ratio test. Recall that |/„| is known, 
but the true location of the bump is unknown. Motivated by (121 it is determined by 


^Y) := sup 4 E + 


(16) 


as test statistic. For simplicity of presentation, we will just write Tn for the statistic in (161 in the 
following and drop the dependence on the parameters. 

Furthermore let 

(1 if r„ (F) > c* 

0 else 


(F) := 


(17) 


be the corresponding test where c* „ G K is determined by the test level. 

In the following we will be able to analyze the likelihood ratio test ([l7|) in DMR, ER and 
DVR with the help of Lemma |A.1| For the relaxed situations where 74 0 we will prove lower 
bounds including logarithmic terms in Theorems |4.2| and |4.3[ Upper bounds including logarithmic 
terms cannot be obtained by the deviation inequalities in Lemma |A.l[ and we furthermore found 
no deviation inequality including logarithmic terms in the literature. Even worse we are not in 
position to prove such an inequality here, and thus we will not be able to provide upper bounds 
which coincide with the lower ones in the relaxed regimes. 

Now we are able to present the main theorem of this Section: 

Theorem 3.3. Assume the HBR model with |/„| 0, cr„ > 0 holds true and let Hq, he as in 

([^. Let a G (0,1) he a given significance level and 


^a,n — '^\In 


rr2/<-4 


-21og(a|/„|) + 2Wn|/„| 1 + 2 


A2 


log 


Q: I ^n. 


(18) 


Assume furthermore that A„, |/„| and Kn satisfy the following condition: 


n I in 


AS 


+ 


> 2 kS log ( Er 
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+ 2 k„ log - 



(19) 



n |/„ 


+ 2 (1 + kS) ^ I 
4 / 




log 


Then the test with the statistic given Tn defined in ( |16[ ) and the threshold ( |18[ ) satisfies 
limsupd (4)„) < q; and limsup/3 ($„) < a. 


This theorem allows us to analyze the upper bounds obtained by the likelihood ratio test in the 
regimes DMR, ER and DVR. 

Remark 3.4. Suppose for a second that not only the width |/„|, but also the location In of the 
bump is known. In this case, the alternative becomes simple, and the analyzed likelihood ratio test 
will he optimal as it is the Neyman-Pearson test. It can readily he seen from the proofs that all 
log (I/„I)-terms vanish in this situation, whereas the other expressions stay the same. Thus our 
analysis will also determine the detection boundary in this case. 


11 


















4. Lower bounds for the detection boundary 

We will now determine lower bounds in the three different regimes by analyzing (15). 


4.1. Dominant mean regime (DMR) 

As mentioned before, we expect the same lower bounds as for the homogeneous situation here. 

Theorem 4.1. Assume the HBR model with |/„| \ 0 and let (sn) be any sequence such that 
Sn 0,e„v^-log(|/„|) —)■ oo. 

1. If a^/An —>■ 0 and tr^ = o(£n) as n oo, then the sequence S with 

\/n |/„|A„ ^V2ao - e„) a/- log(|/„|) (20) 

is undetectable. 


2. If tT^/A„ —>■ 0 where cr^ = tr^ (1 + o (£„)) and 1/A^ 
S with (20) is undetectable. 


o(£n) as n ^ OO, then the sequence 


4.2. Equilibrium regime (ER) 

Now let us consider the case that af, and A„ are asymptotically of the same order. Consequently 
and A„ will be of the same order. In this situation we expect a gain by the additional information 
coming from cr^ > 0. In fact, the following Theorem states that the constant in the detection 
boundary changes, but the detection rate stays the same: 

Theorem 4.2. Assume the HBR model with |/„| 0 and let (sn) be any sequence such that 

£n 0,£„a/- l0g(|4|) -)► OO. 

1. Let cr^ = caoAn (1 + o (£«)); c > 0 and af, = o (£„) as n ^ oo. Then the sequence S with 


y/n\In\An ;< (C - £„) y/- \og {\In\), C := (21a) 

as n ^ oo is undetectable. 

2. If af, = cr^(l + o(£„)) and A„ = ^(l + o(£n)) as n ^ oo, then with k := cr/tJo the 
sequence S with 

^M^:<{C-£,,)^/-\og{\Ir,\), C:= ^ = (21b) 

y ^ (^2 + c2) - 1 log (1 + k 2 ) 

as n ^ oo is undetectable. 


In the first case, for c = 0, we have C = V^cro, which corresponds to no change in the variance 
and hence reduces to the homogeneous model. But if c > 0, we always obtain C < V^ctq, more 

precisely we improve by the multiplicative factor 

,- 1/2 


As C = 


(l? i log (1 + ^2)) 


is not a multiple of \/2aQ, the gain in the second 
case (21b) is not that obvious. But using log (l + k^) < with equality if and only if k = 0 
implies that C < y/^ao with equality if and only if k = 0. 

Finally, if (T„ —> 0, then by a Taylor expansion it follows that 






+ O (cr® ) , cr„ 


and hence (21b) will reduce to (21a) in this situation. 
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4.3. Dominant variance regime (DVR) 

Now let us consider the case that the jump in the mean A„ vanishes faster than the jump in 
the variance cr^. In this situation we again expect a further gain by the additional information. 
Somewhat surprisingly, we will even obtain a gain in the detection rate compared to the ER in 

(IT). 

Theorem 4.3. Assume the HBR model with |/„| \ 0 and let (sn) be any sequence such that 
Sn 0,e„v^-log(|/„|) —)■ oo. 

1. IfaoAn = cr'^On with sequences A„, cr„, —)■ 0 as n —)■ oo where = o (£„) and 9^ = o {€„), 

then the sequence S with 

i/n |/„|A„ (2 (To - £n) \/- log(|/„|)6»„ (22a) 

is undetectable. 


2. // cr„ = cr (1 + o (£rt)) and A^ = o (Sn) as n —> oo, then the sequence S with 


\/^;^ (C'-EnjV-logd/nl), C: = 


1 


^ - 5 log (1 + 


(22b) 


is undetectable. 


First note that (22a I can be equivalently formulated as 

\/n\In 


-< 


(2 - £„) v'-log(|/„|), 


(23) 


which is also meaningful if A„ = 0 and hence determines the detection boundary if only a jump 
in the variance with homogeneous mean ^ = 0 occurs. We want to emphasize that the exponent 


2 of Kn in (231 seems natural, as testing the variance for a change is equivalent to testing the 
mean of the transformed quantities {Yi — Anl/^ (i/'n)) (see [191 Sect. 2.8.7.]). Consequently, the 


detection rate improves compared to the DMR (20), as we assume k„/A„ —oo. 


Note that we can also rewrite (21a) in terms of the «;„-rate, which gives the equivalent expression 


V'>T'Un\Kl :<iC- Sn) 'J - log(|/„|). 


C := 2 


2 + b 


,2 ■ 


(24) 


This makes the ER comparable with the DVR. As expected C < 2, and thus we see a clear 


improvement in the constant over (23). If c —> oo, then C tends to 2 which coincides with (23). 
Again, if k —> 0 it can be seen by a Taylor expansion that (22b) reduces to (22a). Furthermore, 


(22b) can be seen as the natural extension of (21b) as c —)■ oo. 


4.4. Discussion 


The condition cr^ = o(£„) seems to be unavoidable as soon as a Taylor expansion (cf. (49c)) is 
about to be used. 

In several assumptions we require a convergence behavior of the form 1 + o (Sn). Note that this 
cannot readily be replaced by 1 + o (1), because the detection boundary is well-defined only up to 
a term of order e„. Thus if the convergence behavior is slower, at least the constants will need to 
change depending on the specific convergence behavior. 


5. Upper bounds for the detection boundary 


After determining lower bounds for the detection boundary, we now aim for proving that those 
are sharp in the following sense: There exists a level a test which detects with asymptotic type 


II error < a bump functions having asymptotic behavior of the form (20), (21) and (22) where 
(C — e„) on the right-hand side is replaced by {C + Sn). 
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5.1. Dominant mean regime (DMR) 

Let us first consider the case a'^jI\n 0 and = o (£„) as n —>■ oo. The following theorem states 
that our lower bounds are optimal by exploiting the likelihood ratio test: 

Theorem 5.1. Assume the HBR model with |/„| 0 and let Hq,Hi be defined by ([^. Suppose 

En > 0 is any sequence such that £„-\/— log (|/n|) —>■ oo as n —>■ oo and a G [0,1] is a given 
significance level. 

Furthermore, let 

1. either a'^jAn —>■ 0 and cr^ = o (£„) 

2. or an I An -G 0 and a^ = (1 + o (£«)) 


as n 


oo. If 


i/n |/„|A„ F (^V2ao + £n) V-log(|/„|), 
then the test 0 with Tn defined in ( [T^ and the threshold ( |I^ satisfies 
lim sup d (<!>„)< a and limsup^S ($„) < a. 


(25) 


5.2. Equilibrium regime (ER) 

Theorem 5.2. Assume the HBR model with |J„| \ 0 and let Ho,Hi be defined by ([^. Suppose 
En > 0 is any sequence such that En\/ — log (|dn|) -G oo as n ^ oo and a G [ 0 , 1 ] is a given 
significance level. 

Furthermore let cr^ = caoAn (1 + o (En)) with c > 0 and af, = o (En) as n ^ oo and 


y/n\In\An ^ (C + £„) V-log ([/„]), C := \/2ao\J~:^ 


(26) 


2 + c2' 

Then the test 0 with Tn defined in ( |16[ ) and the threshold ( |18[ ) satisfies 
lim sup d (<!>„)< a and limsup^S ($„) < o. 

n—>-oo n—>-oo 

Comparing Theorems |4.2| and |5.2| one may note that our upper bounds so far do not handle 
the case where does not tend to 0. In fact we can also obtain an upper bound from (19) for 
an = a^ (1 + o (£n)) and A„ = ^ (1 + o (En)) as n —>■ oo, i.e. 


a / 2«-2 + ^ + 1 + . / n - 

^fc(C + £„)x/-log ([/„]), C:=V - / V -(27) 

Obviously, this bound does not coincide with the lower bounds from Theorem |4. 2 1 This is due to 
the fact that our tail estimates for the distribution does not include any logarithmic terms, but 
our lower bounds do. Thus it is impossible to obtain the same bounds using Lemma |A.l| with our 
technique. 


5.3. Dominant variance regime (DVR) 


Theorem 5.3. Assume the HBR model with |J„| \ 0 and let Hq,H^ be defined by (§. Suppose 
En > 0 is any sequence such that En\/ — log (|dn|) -G oo as n ^ oo and a G [0,1] is a given 
significance level. 

Furthermore let ctoA„ = af,9n with sequences An, an, 0n -G 0 as n oo where af^ = o(En), 
0n = o (En) and 


^/n\In\An >: (2ao + En) a/- log(|/„|) 6 »„ 

Then the test 0 with Tn defined in (|16[) and the threshold (|18[) satisfies 


( 28 ) 


limsupa (<i)„) < a and limsup/3 ($„) < a. 

n—>-oo n—)-oo 
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Note again that we can also obtain upper bounds in case that (T„ = ct (1 + o(£n)) and = 
o (e„) as n —>■ oo, i.e. 

fc(C' + e„)v/-log{|/n|), C:= + (29) 

which again cannot coincide with the lower bounds from Theorem |4.3| for the same reason as in 
the equilibrium regime. 


5.4. Adaptation 

In this Section we will discuss an adaptive version of the likelihood ratio test for which A„ does 
not need to be known. In this case it is natural to replace A„ by its empirical version A„ := 


(n \In\) ^ This leads to the marginal likelihood ratio 


Ln{In,Kn;Y) = (kI + I) ^ exp 




n \In 


2o-g + 1) . * 2cr2 (^2 




= + 1 ) " exp 

and to its corresponding test statistic 

( 

(y) .= sup 


2 o -0 + 1 ) 




+ 


\ln 




2a§ 


AnSAr, 


O'o + 1 ) 


E ( 


1 ^;-A. 




\J-n\ 


E 


(30) 


i:-£A„ 


Again we will drop the dependence on the parameters subsequently and write T* := 


for the statistic in (301 to ease the presentation. Let 

'1 ifT*(r)>c* 


:= 


0 else 


(31) 


be the corresponding adaptive test where c* „ S M is determined by the test level a. An analysis 
of the likelihood ratio test (311 with T* as in (30) can be carried out similarly to the non-adaptive 
case. The main difference is that we now need Lemma [A. 1| with k = 2. Thus, the following result 
can be proven: 

Theorem 5.4. Assume the HBR model with |J„| \ 0 and let Hq,H^ be defined by (HI- Suppose 
Sn > 0 is any sequence such that en\/ — log (|dn|) -A 00 as n ^ 00 and a G [ 0 , 1 ] is a given 
significance level. Define the corresponding threshold by 


:=^^(n|/„|-l) + l + 2 , 
“T -L 


(n |/„| - 1 ) + 1 


log 




+ 2 log 


\ .(^n + 1 ) 

Now suppose that we are in one of the following three situations: 

DMR: k^/A„ —>■ 0 and 1/A„ = o{£n) as n —>■ 00 and 

i/n |/„|A„ ^ (y2ao + \/- log(|/„|). 

ER: af = cagAn (1 + o {Sn)) with c > 0 and af = o (e„) as n —>■ 00 and 

c + ^2 + 3c^ 


■\In\ 


(32) 


\/n |/„|A„ ^ (C + £n) \/- log(|/„|), c := (To- 


1 + 


(33) 


(34) 
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DVR: aoAn = with sequences A„,cr„,0„ —)■ 0 as n —>■ oo where cr^ = o(e„), 9n = o(en) and 
furthermore 

i/n |/„|A„ ^ (To + En) V- log(|/„|)6»„. (35) 


/n an?/ of these cases the test (31) with the statistic given defined in (30) and the threshold 
(32) satisfies 

lim sup a (<&*)< q; and limsup;S ($*) < a. 


The above theorem states upper bounds which are adaptive to the height A„ of the bump. It 
is remarkable that the adaptive upper bound coincides with the non-adaptive one (cf. Theorem 


5.1) in the dominant mean regime, where in the two other regimes the rates coincide but we lose 


a constant. 


6. Simulations 


In this Section we will examine the finite sample properties of the test discussed in Section by 
means of a simulation study. We therefore implemented the non-adaptive likelihood ratio test 
(cf. with the threshold c* ^ as in ( [T^ as well as the adaptive likelihood ratio test <I>* (cf. 
with c* „ as in (32). For these tests the levels a ($„), a ($*) and powers 1 — /3 ($„) ,1-/3 ($*) 
have been examined by 10^ simulation runs for a = 0.05 with different parameters (Tq, n, Kn, A„ 
and |/„|, respectively. 

All derived detection boundaries involve the ratio n | J„| / log (1/ |/„|). For the simulations below, 
we therefore hxed this ratio and then considered three situations, namely small sample size (|/n| = 
0.1, n = 829), medium sample size (|/„| = 0.05, n = 2157) and large sample size (|/„| = 0.025, 
n = 5312). In all three situations we computed the power for (Tq = 1, G {0.01, 0.02,..., 1.2} and 
A„ G (0.01,0.02,..., 0.7}. The corresponding results are shown in Figure]^ 


Power loss caused by adaptivity. By comparing the constants in Theorems |4.1[{4~3| and Theorem 
5.4 it can be seen (as discussed in Section]^ that the adaptive test is (asymptotically) at least 


as powerful as non-adaptive test if A„ and are multiplied by a factor of v^- By comparing 


subplots |5(b) I and 5(a) 5(d) and 5(c) and |5(f)| and 5(e) respectively, it can be seen roughly that 
this also holds true in the simulations. Interestingly, the shape of the function r from Figure 
becomes also visible as a small ’belly’ on the lower right in the adaptive large sample situation 
(subplot |5(f)[), i.e. A„ has initially to increase with to reach the same power. 


Improvement by heterogeneity. Let us consider an exemplary situation from the medium sample 
size case, i.e. n = 2157 and |/„| = 0.05. From subplot 5(c)| it can be seen that the empirical power 
for An = 0.2 increases dramatically when we alter from = 0 to = 0.5. For = 0 we 
use the standard likelihood ratio test for comparison, which is based on the statistic 


(Y) = 


1 

sup —2 

O’, 






Y, 


and uses the exact quantiles of Af (0, n |/„|) and the union bound to compute the threshold c* „. 
A comparison of the two situations is shown in Figure Remarkably, the non-adaptive test at 
level 5% is able to detect the bump when = 0.5, but not when = 0 for the depicted data. 
Note that the empirical power of the test in the heterogeneous situation is 47.1%, whereas the one 
of the test in the homogeneous situation is only 22.5% (note that this is larger than the power 
depicted in Figure]^ for = 0.01 due to the used exact quantiles for the homogeneous test). 
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A„ A„ 

(c) non-adaptive test power, medium sample case (d) adaptive test power, medium sample case 





1 

0.8 

0.6 

0.4 


0.2 
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(e) non-adaptive test power, large sample case 


(f) adaptive test power, large sample case 


Figure 5: Power of non-adaptive (known mean A„) and adaptive (A„ unknown) tests plotted for 
A„ vs. in three different situations (small sample size |/„| = 0.1, n = 829, medium 
sample size |J„| = 0.05, n = 2157, and large sample size |J„| = 0.025, n = 5312), each 
simulated by 10^ Monte Carlo experiments. 


7. Conclusions and future work 


Summary. We obtain asymptotic minimax testing rates for the problem of detecting a bump 
with unknown location /„ in a heterogeneous Gaussian regression model, the HER model. The 


lower bound condition (15) guides us for a distinction into three possible regimes in the HER 


model. It allows to quantify the influence of the presence of an additional information about the 
change in the variance on the detection rate. Both cases of known and unknown bump height A„ 
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~'^0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

(a) homogeneous bump detection - the likelihood ratio test cannot detect a bump 




Figure 6: Comparison of homogeneous and heterogeneous bump detection. We simulated data 
from the model (fTj) once with A„ = 1 (homogeneous) and once with A„(a;) = 
x/TT^ • l 7 „(a:) (heterogeneous) for the same errors 1 < * < n. Here n = 2157, 
In = [0.5,0.55] and A„ = 0.2. Even though the observations look quite similar in both 
situations, the non-adaptive test 0 with level 5% is able to detect a bump in the 
heterogeneous situation, but not in the homogeneous situation. 


are considered. In three possible regimes of the HER model the exact constants of the asymptotic 
testing risk are obtained for the case of known A„. In the case of adaptation to an unknown A„ 
the obtained rates remain the same as for the case of known A„. The optimal constant coincides in 
the DMR with the case for known A„, else it is unknown (see Tablefor details). The obtained 
results are based on non-asymptotic deviation inequalities for the type I and type II errors of 
testing. We would like to stress that the thresholds (181 and (32) of both tests are non-asymptotic 
in the sense that they guarantee the non-asymptotic significance level a. This allows to apply the 
proposed tests even for finite samples. We also provide non-asymptotic upper detection bound 
conditions (19) and (53). Lemma 3.1 on the lower bound estimate can be easily proven for the 
case of a non-asymptotic type I error as well. 


Open issues and extensions. Our results are incomplete concerning the situation if —)■ k > 0 
in the equilibrium and the dominant variance regime. We applied the same techniques to prove 
lower bounds in this case, and we conjecture that these lower bounds are optimal, i.e. determine 
the detection boundary. We were not able to prove coinciding upper bounds due to limitations 
in the used deviation inequalities. To do so we would need deviation inequalities for random 
variables involving the logarithmic terms from the moments, but such a result is beyond the scope 
of this paper. 

Furthermore, we only investigated adaptation to A„, but it would also be interesting to see what 
happens if the “baseline” variance cto or / and cr„ is unknown. By using the empirical variance on 
/„ as an estimator, this would lead to a likelihood ratio test with a test statistic involving fourth 
powers of Gaussian random variables. For the analysis good tail inequalities for such random 
variables are on demand. 

Additionally, one could also ask for adaptation w.r.t. |/„| and corresponding lower bounds, but 
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this is far beyond the scope of this paper and requires further research. See, however, |24j for the 
homogeneous case. 

Another important open question is concerned with the situation of multiple jumps. In the 
homogeneous case this has been addressed in [21], and upper bounds with constants 4 for a 
bounded and 12 for an unbounded number of change points have been proven. Even though it 
seems likely that these constants are optimal, no corresponding lower bounds have been proven 
so far. In the present HER model this is an open question as well. 

As stressed in the Introduction we currently only deal with the HER model, i.e. that a jump in 
mean also enforces a jump in variance (/«„ > 0). But often it is more reasonable to consider the 
situation that whenever a jump in mean happens, there can also be a jump in variance, but not 
conversely. This would be modeled by letting At„ € [0, oo). For simplicity one could restrict oneself 
to Kn G {0, Ki^n}- Still it is Unclear what the detection boundary should be in this situation. In 
fact it is not even clear if the additional uncertainty leads to a loss of information. 

We believe that lower bounds can be constructed in the same way (and it is likely that they 
stay the same), but the calculation of upper bounds seems quite more involved. 
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A. A chi-squared deviation inequality 

For the analysis of the likelihood ratio test and the adaptive version for unknown A„ in Subsection 
5.4 a specific deviation inequality for the weighted sum of two non-central distributed random 
variables is required. Recall that X ^ Xd with non-centrality parameter and d degrees of 
freedom, if 

d d 

X = ^ {^j -I- aj)^ where Af (0,1) and ^ aj. 

t=i i=i 

In this case E [A] = d + and V [X] = 2 (^d + 2a^). 

In the following we consider the weighted sum of k non-central chi-squared variables Z = 
where bi > 0 and Xi ^ Xd (®i) independent with G N, af > 0, * = 1, .. . , /c. 

Note that 

k k 

E[Z]='^b,{d, + a^^) and Y [Z] = 2^^ bf (d, + 2a^^). 

2=1 2=1 


Lemma A.l. Let Z = where bi > 0 and Xi ~ Xd (®i) independent with di G N, 

af >0,i = 
all X > 0: 


af > 0, i = 1,..., k. Let |i&||oo = maxi<i<fc \bi\. Then the following deviation bounds hold true for 


< exp (—x ), 

(36a) 

< exp {—x). 

(36b) 


The above Lemma is a reformulation of [351 Lemma 2]. 
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In the following we will briefly discuss other deviation inequalities for distributed random 
variables [3 0 [nii EH 03 Eii and their connections to ( |36[ ) from the perspective of our purpose. 
At first we mention an inequality by Birge [H Lemma 8.1], which deals with k = 1 and coincides 
with the above result in that case. Also related to ( p^ is Prop. 6 in Rohde & Diimbgen 

P (Z > E [Z] + V2V[Z]cr + 4||6||^x2 + 2\\b\\^x) < exp {—x), X > 0. (37) 

This differs from our bound by the additional 4||6||^a:^ term in the square root, revealing 

(36b) as strictly sharper. 

Other deviation results were proven for second order forms /„ (s) = + w'^s, s G 


of Gaussian vectors with B G G To relate this with our situation, let d = X)i=i 

choose any aj^i G K such that J2j=i all 1 < i < fc and set 

B := diag (^\/h ■ Idi, • Ida, Vh ■ Id^) , 

w := ( 6 iaix, &ia2,i, ^lOdiP, &2ai,2, &2ad2,2, i'aoi.a, • 

Here Im = (1,..., 1) € K™. Let Tr (A) be the trace of a matrix A. Then Tr (B^B) = Tr (B^) = 
ELi bzdi, Tr (b4) = b^d^, \\B^\\oo = ||6|loo and w'^w = ELi • This yields 


U (C) = ^ = Z - E [Z] + Tr (B^) , C ~ A7(0,/d) 


(38) 


i=l 


with Z as in Lemma lA.il 

A deviation inequality for the case w ^ Q can be found in the monograph by Ben-Tal et al. [3 
Prop. 4.5.10], which provides in our setting the bound 

^fw (C) > Tr(B2) + 2y^||B2||2a;2 + 2(Tr(B4) + 2||B2||a;^ < exp (-x) (39) 

for all a: > 0, where ||B^|| denotes the spectral norm of B^ given by the square root of the largest 
eigenvalue. Taking (38) into account, this yields 


P Z > E (Z] + 2^ 


A, (di+af)a; + ||6||^a:2 

\ i=i 


, a; < exp (—t), t > 0 


for Z as in Lemma A.l As 8^^^^ hf (di + of) > 2V ]Z], this inequality is strictly weaker than 
the Diimbgen-Rohde estimate (37) in the present setting. 

For the case ic = 0 (corresponding to centered random variables or = 0 for 1 < i < A: in 
our notation). Lemma A.l reduces to the deviation inequality in Hsu et al. |27j : 


(/o (C) > Tr(B2) + 2v/Tr(B4)x + 2 |!b2||^x) < exp (—x), X > 0. 


(40) 


The situation w = 0 has also been considered by Spokoiny & Zhilova [49], where the following 
inequality is proven: 

P (/o (C) > Tr(B^)-I-max |2-y/2^rr(iT^, 6||B^||ooa:|^ < exp (—a;), x>0 (41) 

Comparing (40) with (41) gives that ( [41] ) is sharper for moderate deviations, i.e. if 

^ Tr (B4) 1 Tr (B^) 

(3-2^2) <a:< T ^ ^ 


B2||^ - - 4 ||B2||^ 


for large and small deviations it is weaker. We are actually interested in large deviations, and 
furthermore the restriction = 0 makes (41) not applicable for our purpose. 
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B. Proofs of Section S] 


Proof of Lemma \3.1\ We follow 


I- P ($„) - a< inf [^‘n (i^)] - a 

{lJ^nAn)^Sn 

=^2+^2 1^^ [®n (!")] - a 

-~r X! ®Mr.=A„l/„,A2=cr2+cr2l^„ [‘i’ri (y)] - a 


InGAn 


-T~ X/ ®A‘^=A„l7„,A2=o-2+o-2l7„ [‘J’n (^) - E^=o [<&„ (F)]] + o(l) 




- ^ L„(A„,/„,K„,r)-iU„(y) 


Iri^Ar, 


■o(l) 


y 'y ^ Ln 1 

" /^G^„ 


'(1). 


as n —>■ oo. 


□ 


Proof of Theorem \3.^ By Lemma 3.1 we have to prove that the right-hand side of (14) tends to 
0 under the given conditions. 

Suppose that 


lim Eyi_ 0 ,A=Cro \f{\Ln{An,In,f^n,y) — ^\^C^n} | (Ayj , dyj , /vyj , l|j 0 


(42) 


for any ^ > 0. Then the weak law of large numbers for triangular arrays (the condition ( |42[ ) 
implies the conditions imposed in HH Ex. 2.2]) is applicable (as |/„| \ 0 and hence In Z' oo), 
i.e. TO“^ YT=i -t 1 in probability with m = and Zi = Ln (A„, [(i - 1 ) |/„| |/„|], k„,T). As 

> 0 we have 


E 


m 


2—1 
— 1 


-i m -.771 

= -^E[|Z,|] = -^E[Z,] = l 


for all m S N, 


i=l 


i=l 


which then implies —t 1 in expectation, i.e. (14) tends to 0. 

The only thing left to prove is that (42) holds true under the imposed conditions. To show this 
we will use the moments of L„ = L„ (A„,/c„, T) under the null hypothesis. Note that under 

the null hypothesis ^ = 0 it holds Yi 'A?' J\f (Q, (Jq) and thus due to independence for 77 > 0 


E,i=o,A=(To — (1 + 


2\-') 


mT,\ 


E;i— o,a=(7o 


I ^’n^i Y 2Yi/S.n — AH 

exp ?7 ^ 


i:-eA„ 


= (1 


.2\-V 


^Unl 


= (1 + .^)-’^^ 


^E/2=0,A=cro 

n E 

i-A&Ar^,i 


n 

i‘. ^ GAn.i 


2{1 + Kl)a^ 


,IY^ + 2y,A„ - Al 

2{1 + Kl)a^ 


exp 77 


2^2 y2 




A2 + 2aoAA„ - Al 


2(1 + kI) erg 


= (1 


2 \-’n 


AIr^\ 


E 


exp 77 


2 ^ 2 . y2 




+ 2anXAn — Al 


2 (1 + 4) erg 


n|/„| 


(43) 
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where X ~ A/'(0,1). Thus we need to calculate the expectation of 


exp 77 


+ 2cro^A„ — 


2{1 + Kl)a^ 


where we abbreviated 


A = 


= exp - 


2{1 + kI) 
A„ 


20-2 


y • exp 


(r7s(A + Ar 


aoKl' 


(44) 

(45) 

(46) 


The right-hand side in (44) corresponds to the Laplace transform of a non-central distributed 

exp ( 1 ^) 


random variable given by 


E 


exp 


{t {x + xy 


if A S K, t < 1/2, otherwise the Laplace transform does not exist. Note that t = rjs < 1/2 if and 
only if 77 < 1 -I- 1/k^). In this case the expectation is given by 


E 


exp 77 




A2 + 2aoXA„ - Al 


2(1 + kI) erg 


= exp 


2al^l 


\/l + 


E 


exp 


V2CTg(l + (l-77)K2)y ■ 


( 77 s (A + A) 
^ - 1 ) 


By (431 this implies 

^fi=0,X=tTo [Ln] 


= (1- 

(1 






"v/l + 


v /1 + (1 - 77 ) kI V 2 o-g (1 + (1 - v) kI) 

( 77 ( 77 -l)n|/„| A 2 




n|/„| 


(1 + (1_^) 4)^ ''''''V2fTg(l + (l-r7)/t2); ■ 

To prove ( [4^ we now note for any n such that > 1/^ as > 0 we have 
1^71=0,A=cro [l{|iTi-l|>Jin} l-^n ~ 1|] 

00 

= J ]P 7 t= 0 ,A=cro > a; + 1) da; -f C^n]P 7 t= 0 ,A=(To {^n + + 1) 

ilr. 

oo 

< J PT^^O.A^ao (-frn > a;) da; -|- ^^„P7t=0,A=cro {Ln > yin) 


—®M=0,A=tTo 

— p7i=0,A=tTo 


(Ln)"^' {yin)~' 


= exp 


?a-|l^n,| A (1 -|-(5) (5 r*a'|l^r!,|, 2\ A 2\ , r i nr |\ ri reN 

log(l-f K„)-— log(l-(5K„) -b^log(|/„|)-(51og(0 


2crg 1 - SkI 


(48) 


Now, for (48 1 to tend to 0, the exponent must converge to —oo. Consequently, the weak law of 
large numbers is applicable and finally (14) tends to 0 if (151 holds true. □ 
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Proof of Theorem 3.3. Recall that the test statistic is given by 


Tn{y) = sup S{A„), 


with the inner part 


S(/U:=^ E r, + ^ 




A, 


We readily see that S [An) obeys the following distributions 

Under Hq : S[An) ~ xl\i„\ {, 

Under H'f : S[In) {^ + kI) xl\i„\ ({^ + ^1) ■ 

Here /„ in the alternative denotes the true position of the jump. 

Using (|36b|) with A: = 1, 6i = 1, di = n |/„| and a? = we find 

'-' ^O'^n 

Fho { sup S[In) > c*,„) <7^P//o {S[In) > C* ) 

J Fn\ 


I4|' 

<a. 


^2 

^n\In\ 


n 


141 An 


> 


For the type II error denote 


yn,a. — ^n) ^ ^n) 


2 n 


141 A^ 


rr2K-4 


— 2(1 + Kn)\ n |/„| + 2 (1 + 


^|4| A2 

n-2K4 


log-. 

a 


Then we calculate using (36a) with fc = 1, 6i = (l + ^n)i di = n |/„| and = (l + 
that 


2\ n|/„|A^ 


sup ( sup 5'(H„) < ) < sup inf Pg^ (S'(H„) < y„,„) 

Bn&Art \An&A„ J B„G.4„ G 

< sup Pb„ [S[Bn) < yn,a) 

B„&Ar, 


=P 



+ ^n) Xn|/„| 


^|4| An 

O- 0 «n J 



<a. 

Thus the claim is proven if yn^a A c* i.e. 


(1 + ^V) ^ |4| + (l + ^ 2 L " ~ |4| +2(1 + k'^) 

nn V V 


^I4| A2 

rr2K4 


log ( I 


>n |/n I + 


-|4|A^ 

rr2ft-4 


-21og(a|/„|) + 2Wn|/„| 1 + 2 


A2 

rr'^K^ 


log 


141 ;■ 


But it can be easily seen by rearranging terms and multiplying by that this is equivalent to 

(fTol). □ 
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C. Proofs of Section [4] 


If 0 and / or 0, Taylor’s formula can be used to simplify some of the terms in (15) 

which we will use in the following: 


(1 + 5n) 5„ 
1 - 5nHn 


— Sn + S'^ [l + (l + K^) + + O , 


5lf 


(49a) 

(49b) 

(49c) 


Proof of Theorem \4.S\ Note that (5„ < ^ is satisfied for free here as the right-hand side diverges 
or is constant and —>■ 0. Furthermore in both cases the assumption n |J„| A„ = O (log (|/n|)) is 
satished as well. Thus we can apply Theorem |3.2[ 


- log (1 - 5^kI) = S^kI + ^ + 0 , 

— ^n log (1 + K^) = —^n^n ^ ^ ('^n^n) ■ 


1. Under the assumptions of this part of the theorem, we may use (49a)-(49c) and insert 
= cIS.nl oq (1 -I- o(£„)) to find that (15) is satisfied if 


2^0 


1 + - +log(|/„|) 


\In\Al 

2^2 


^1 + + O (^SnSnn \In\ ^ —OO. 

(50) 


Here we used that A„ = o(en)- Then with := Ug ^e„ and using (21al we obtain 


n\In\Al c‘ 


2a2 


and furthermore 


1-f 


■log(l-fnl) 


= O-Q log (141) (l- {C - Enf C 
= 2 cro'^C 7"42 log(|4|) - log(|4| 


^|4|A^ 

2cr2 


1 + T ) + 20-0 log (|4|) 


=a^^el log (141) (-ag-i (C - Enf 4-2 + 2C-^) 

= (cto-2 (^2T4 -i)eI + 0 {eD) log (141). 

It can be readily seen that Ci := ^/2 + — 1 > a/S — 1 > 0 for all c > 0. Thus under (21a I 


we have 


4 


n\In\Al / ^ c 


2a, 


2 ^ 1 + Y ) + log (|4|) 


c2«|4|A2 ! C- 

" 2al ' 


= (Cie^ + o (4)) log (|/„|) ^ -OO 
as n —)• OO. Thus is satisfied. 


2. Here only (49a) and (49b I can be applied. Thus (15) is satisfied if 

AS 


4 


n 4 


1 

2al ~ 2 y +o(£n) ) +log(|4|) 


-f4«l4 


AS 




(51) 


+ 0 (4^141) -OO 


24 
































Inserting the assumption A„ = ^ (1 + o(£„)) 
with ?7 > 0 to be chosen later we find 


(1 + 0 (£„)), (21bI and (5„ := 77£„ 


' ~ 2 + ^^) + “^ + o{£n) ) + log (|/„ 


=77£„ log(|/„|) 

=2r;C-i£2 log (|/„|) (1 + 0(1)) 


1 - (C - £nf ( ^ ^ log (1 + K^) ) + O (£„) 


1 


and 


5n'n\In\ (l + '^^) + ^ + O (^n) +277(7 ^£^ log ( |/„ |) (1 + O (1) ) + (7 ((5^71 |/„ |) 
=rfel log (|/„|) (^277-1(7-1 - (C - ^nf (l + + ^ ^ + o (1)^ 

=rfel log (141) ^2?7-1(7 -i - (l + k^) + '^^ + o (1)^ ^ -oo 

(l + '«^) + 


if we choose e.g. rj ^ = C' 


. Thus (15) is satisfied. 


□ 


Proof of Theorem\4.1\ 1- This follows directly from Theorem |4.2| with c = Cn —> 0. 


2. Similarly as in the proof of Theorem 4.2 we have to show that (511 is satisfied. Therefore 
note that with 4 := T/£n with 77 > 0 to be chosen later we have 

l-^nl ~ log (1 + ^^) + ^ + O (^n)^ + log (141) 


1- (v^O-o-£„) ^ ("y -log(l + K2)^ +o(£n) 


=77£„l0g(|4|) 

=V2r]aQ^£l log (|4|) (1 + 0 (1)) 
by the assumption that 1/A^ = o{£n)- Furthermore 
'A^ 


S^nlln] 




+ 2 / 2770-0 ^el log(|4|) (1 + 0(1)) 


=7742 log (|4|) (V2?7 4 o ^ - (1 + + 

if we choose e.g. 77 = (l + 4) ^. This proves the claim. 


□ 


Proof of Theorem 4-3 Note again that 6n < -^ is satisfied for free here as the right-hand side 
diverges or is constant and 4 0. Furthermore in both cases the assumption ?r|4|A„ = 


O (log (|7n|)) is satisfied as well. Thus we can apply Theorem 3.2 


1. Under the assumptions of this part of the theorem, we may again use (49al-(49cl to find 
that dl^ is satisfied if 


+77 141'f+log(|4|))+^^ 


7714| Al ,, , k; 


2^2 


^In 


+0{6n£n l0g(|4|)) -t -00 
(52) 
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Here we used that = o{sn)- By ^ we have under (22al with <5^ •= 

that 




—*^0 


^Iu\Al 

2a^ 


1 + 


202 


log (141) 


log (i 7 „i) (i - { 1 +oy) 

= - 2en log (|/„|) — + f \el log (|4|) - ^el log (|/„|) 
0*0 V'^o 


4"" 


=4log (141) ^ +o(l) , 


where we used 0 ^ = o(e„). Furthermore 


,2 fn \In\Al l4 

2al + ' 4 


+ 4l0g(|4|) ^ + 0 ( 1 ) +0(4£nl0g(|4|)) 


=ao-44og(|4|) (^2-Q 

= 4 log(| 4 |)(l + o(l)) ^ -oo. 


2 + 4 ) + o(l) 


Thus (52) and hence (15) is satisfied. 


2. Similarly as in the proof of Theorem 4.2 we have to show that (51) is satisfied. Therefore 
note that with 4 := r\En where r] > 0 will be chosen later we have 

l^"l (~ 2 + ^^) + “4 + o(£n)'j + log (|4 


=rjen log (141) 


1 + (C - enf ^2 1°§ (l^ + - 


o(4 log(|4|)) 


=24 44log(|4|) (1+ o(l)) 
by the assumption that = o (e„). Furthermore 


5in\In 


4 

+ — 


” ' _2a2 ' 4 


24-44 log (141) (1 + 0(1)) 
2 


=0 4 log (141) 24 r] -(4-e„) —+o(l) 


if we choose e.g. rj 


4^y, which proves the claim. 


□ 


D. Proofs of Section 


Proof of Theorem 5.2 Due to Theorem 3.3 we only have to show that under the assumptions of the 
theorem the condition (19) is satisfied as n —>■ 00. The given situation is = cA„/(To (1 + o (£«)). 
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Inserting this into ( |19[ ) we have to show that 
/'„2 


\^ri\ I 

2 w C r, 


. 2cA„ 

>-log 

CTo 


+ 2 1 + 


O’o ^0 


a\I, 

cAyj 

ctq 


+ 24 


^^^^(2 + c2)log 


/n|/„| A2 


O-Q 


c2 + 2 + 


2cA^ 

CTo 


log ( - 


a 


as n —)■ oo. Inserting (p^ and dividing by — log (|/^|) yields that 


(c2 + 2) 


>- 


2cA„ 

CTo 


1 + 


+ 2 1 + 


logo 

log (141) 

cAj^ 

CTo 


O^O CTg 

+ 2 


(C + Sn) 


(2 + c2) 


log(Q) + log(|4|) 
log (141) 


2cA„\ (C + e^)^ log (a) 


CTo 


erg log(|4|) 


is sufficient as n —^ oo. The deffnition of C and some straightforward calculations using A„ = o (Sn) 
and loga/log(|4|) —t 0 show that is enough to prove 


4v/c^42 c2 + 2 


£n H-2—> 4 + 


242442 


Sn + I 


I log (a) 442+42 

log(|4|) cTo 


I log (a) 
log(|4|)' 


□ 


(To (Tg (Tg 

which directly follows from e„4“ log (|4|) —t c». 

Proof of Theorem \5.1\ 1. This follows directly from Theorem |4.2| with c = Cn —> 0. 

2. Inserting the assumptions and dropping all lower-order contributions, we ffnd with k := cr/erg 


that for (19) the following is sufficient as n —>■ cx): 


2 ) £+ 1 + > 2 ,. log (+) + 

4 VI4|/ erg y 


1 

14 


242 (1 + k)^/^ 


Inserting ([^ and dividing by — log (|4|), we see that it is enough to prove 


2) > 24 + 


24^ (42(70+ £„) + 24^(1 ^ ^^3/2 (42ao +£ 4 /log^^ 
CTo erg y log(|4|) 


But this condition is true due to our requirements on 

□ 

Proof of Theorem \5.3[ Due to Theorem |3.3| we only have to show that under the assumptions of the 
theorem the condition (19) is satisfied as n —t oo. The given situation is = A„/((Tg0„) (1 + o (£„)). 
Inserting this into (19) we have to show that 


1 \ n|4|A2 A„ n|4|A2 2A, 


02 

^•n 


(Tg^y, 


>^log 

eroUn 


2 1 + 


141 A2 


02 


log 


1 

er |4 


ero0^ 


/n|4| A2 


ero 


24+“" 


02 CTo0,i 


log ( - 
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as n —>■ 00 . Inserting (28) and dividing by — log (|/„|) yields that 

(2ao + g„f ^202 ^ ^ (2ao + £n)" g^ >2 (20^ + l) 


log (a) +log(|/„|) 


CTo 


+ 2^ 


( 2 (Jo + £n) 


(20^ + 1) 


logd^nl) 

log (a) 

l0g(|/n|) 


is sufficient as n —> oo where we used that lS.nlQn ~ cTn = o (£n)- Some straightforward calculations 
and skipping all terms of order o(e„) shows that is enough to prove 


4 H-£„ > 4i 

O-Q 1 


'log(a)+ log(|/„|) 2£„ /log(a)+log(|/„|) 


log(|/n|) 


CTO 


log(l-fnl) 


I log (a) 2 £„ / log(a) 

log(|/„|) (To V log(l^"l)' 


Employing Vo + < -x/a + we hnd the sufficient condition 


2 

(^0 


log (a) 


□ 


, l0g(|/n|) 

as n —)■ 00 , which directly follows from en\/— log (|/n|) —t oo. 

Proof of Theorem \5.4\ For An G An let us abbreviate 

Ya„ := {n\In\)-^ 

In our model where the li’s are independent Gaussian, it follows from Cochran’s theorem that 
obeys a Xn\A |-i distribution and Ya„ an independent Xi distribution. Now recall that the 
test statistic is given by 

TniY) = sup S{An), 

An&An 


with the inner part 


S{An) := 


l^l^llnl ^9 . n\In\ ^2 


+ 1) 


Si 


{Ya 


Using the above results, we readily see that S {An) obeys the following distributions 


Under Hq : S{An) 


kI + 1 


Xh7„|-i (0)+X? (0), 


Under H'f : S{In) Klxl\in\-i (0) + («^n + l) X? f 2 

\Oo U + Kn) / 

Here In in the alternative denotes the true position of the jump. 

For simplicity denote by xl a chi-squared random variable with j degrees of freedom and by ^ a 


standard normal variable independent of xl- Now applying Lemma (36b) with k = 2, bi = -^ 


di = n \In\ — 1, 62 = 1, ^2 = 1 and oi = 02 = 0 we get 

Pffo ( sup <7^Pffo (5'(H„) > c*_„) 

VA„G.4„ ' / Pn| 


K^ + 1 ’ 


I 

<a. 
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Let us turn to the type II error. We will apply (36a) with k = 2, bi = di = n|/„| — 1, 
62 = + 1, fi 2 = I, «! = 0 and a| = 


^2 = ■ Denote 




K^n ]/«I + 2k^ + I + 


2(1 + k 2 )A>|/„| 


log-. 

a 


We have 


B„eAr, 


A„eAr, 


sup Pb„ sup S{Ari) <yn,a] < sup inf V’Br,iS{An) <yn,a) 


Bn&An ^neA„ 

< sup Pb„ (5'(B„) < yn,a) 

Bn&Aru 


=r K~ 



^ yn,o 


yn,a 


<a. 


Thus to find the detection boundary conditions we need to investigate the inequality yn,a > c’^ a- 


2 I 7- I I 1 I \In\ r, 

\In\ + 1 H--2^ 


U, 


0 


B-nB |dn| + 2^2 + I + 


2(l + K2)A2n|J„| 


log- 

a 


> 


K^n|/„| + 1 |/„| + 2^2 + 1 / 1 


+ 2 




log 


nl + 1 

First of all we can rewrite this inequality as follows, 

>4iB\In\ , Yi A2n|/, 


\^n I 


- 21 og (a |/„|). 


k2 + 1 ^2 + I 

> 2 


- 2 , 


"0 


B-nB \In\ + 2^2 -)- 1 + 


2(l + «;2) A2n|/„| 


log- 

a 


<71 |/„| + 2 ^ 2+1 

« + lf 


I 


'\In\ 


- 21 og(a|J„|). 


(53) 


Now we analyze it in the three regimes as usual: 

• Dominant mean regime: Inserting /c^/A„ = o(en) and ignoring all o(e„)-terms in the 
following, we Hnd that it is sufficient to prove 


A 2 n|/„| 


I 


> 21 og(p^) + 2 V 2 


'A>|/„ 


/ log ( - 


Inserting (331 and dividing by — log (|/n|) we find that the above is the case if 


2v^crn ^Sn > 4. 


/ log(Q:) 

log(|/n|) 


which is true by assumption. 
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• Equilibrium regime: Inserting K^jAn = o{£n) and ignoring all o(£„)-terms in the following, 
we find that it is sufficient to prove 


\In 


(1 + c^) 


>„ 0 g| + 


'log 1 - ) + 2ci 




'log 


:|/n 


Inserting (341 and dividing by — log {\In\) we find that the above is the case if 


I + 




C" +2- 


-CSr, 


>2 + 2C- + -£„+ ^^^ + ^"^ 

(Tq (To (To 




+ 2 + c 


I log ((3:) 
log (141) 


which is true by the definition of C and our assumption on e„. 

• Dominant variance regime: Inserting = A„/((To0n) and ignoring all o(e„)-terms in the 
following, we find that it is sufficient to prove 


14 

^2 


1 + 


I 


^ 21og ( n-f ) + 2^/ ( 2 + ^ 


1 \ Alnlln 




2 /A2n|4| 


'log 


a |4 


Let (7 = 1 + -v/S. Inserting (351, dividing by — log (|4|)) and using 9^ = o (£„) we find that 
the above holds true if 


42 + 2-^£„ > 2 + 24 + (24 + 24^) J ^ 

^0 V^°s(l4|) 

which is true by the definition of 4 and our assumption on £„. 


□ 
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